Examining Talker and Phoneme Generalization of Dimension-Based Statistical Learning in Speech Perception

نویسندگان

Kaori Idemaru

Lori L. Holt

چکیده

Speech perception flexibly adapts to short-term regularities of the ambient speech input. Recent research demonstrates that the function of an acoustic dimension for speech categorization at a given time is relative to its relationship to the evolving distribution of dimensional regularity across time, and not simply to its fixed value along the dimension. Two studies examine the nature of this dimension-based statistical learning in online word recognition, testing generalization of learning across talkers and across phonetic categories. The results indicate that dimension-based statistical learning is specific to the experienced regularities, resisting transfer across talkers or phonetic categories.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual phonemic ambiguity and speechreading.

PURPOSE To study the role of visual perception of phonemes in visual perception of sentences and words among normal-hearing individuals. METHOD Twenty-four normal-hearing adults identified consonants, words, and sentences, spoken by either a human or a synthetic talker. The synthetic talker was programmed with identical parameters within phoneme groups, hypothetically resulting in simplified ...

متن کامل

Acoustic differences, listener expectations, and the perceptual accommodation of talker variability.

Two talkers' productions of the same phoneme may be quite different acoustically, whereas their productions of different speech sounds may be virtually identical. Despite this lack of invariance in the relationship between the speech signal and linguistic categories, listeners experience phonetic constancy across a wide range of talkers, speaking styles, linguistic contexts, and acoustic enviro...

متن کامل

Auditory}visual integration of talker gender in vowel perception

The experiments reported here used auditory}visual mismatches to compare three approaches to speaker normalization in speech perception: radical invariance, vocal tract normalization, and talker normalization. In contrast to the "rst two, the talker normalization theory assumes that listeners' subjective, abstract impressions of talkers play a role in speech perception. Experiment 1 found that ...

متن کامل

Consonant confusion structure based on machine classification of visual features in continuous speech

This study is a first step in selecting an appropriate subword unit representation to synthesize highly intelligible 3D talking faces. Consonant confusions were obtained with optic features from a 320-sentence database, spoken by a male talker, using Gaussian mixture models and maximum a posteriori classification methods. The results were compared to consonant confusions obtained from visual-on...

متن کامل

Lexically guided phonetic retuning of foreign-accented speech and its generalization.

Listeners use lexical knowledge to retune phoneme categories. When hearing an ambiguous sound between /s/ and /f/ in lexically unambiguous contexts such as gira[s/f], listeners learn to interpret the sound as /f/ because gira[f] is a real word and gira[s] is not. Later, they apply this learning even in lexically ambiguous contexts (perceiving knife rather than nice). Although such retuning coul...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Examining Talker and Phoneme Generalization of Dimension-Based Statistical Learning in Speech Perception

نویسندگان

چکیده

منابع مشابه

Visual phonemic ambiguity and speechreading.

Acoustic differences, listener expectations, and the perceptual accommodation of talker variability.

Auditory}visual integration of talker gender in vowel perception

Consonant confusion structure based on machine classification of visual features in continuous speech

Lexically guided phonetic retuning of foreign-accented speech and its generalization.

عنوان ژورنال:

اشتراک گذاری